Home Catalogue search

eng

Refine your search:

Search in the Catalogues and Directories






	Sort by
Simple Search

Page: 1 2

Hits 1 – 20 of 30

1	Shapley Idioms: Analysing BERT Sentence Embeddings for General Idiom Token Identification
	Nedumpozhimana, Vasudevan; Klubička, Filip; Kelleher, John D.
	In: Front Artif Intell (2022)
	BASE
	Show details

2	Semantic Relatedness and Taxonomic Word Embeddings ...
	Kacmajor, Magdalena; Kelleher, John D.; Klubicka, Filip. - : arXiv, 2020
	BASE
	Show details

3	English WordNet Taxonomic Random Walk Pseudo-Corpora
	Klubicka, Filip; Maldonado, Alfredo; Mahalunkar, Abhijit...
	In: Conference papers (2020)
	BASE
	Show details

4	Language related issues for machine translation between closely related south Slavic languages
	Arcan, Mihael; Klubicka, Filip; Popovic, Maja. - : The COLING 2016 Organizing Committee, 2019
	BASE
	Show details

5	Synthetic, Yet Natural: Properties of WordNet Random Walk Corpora and the impact of rare words on embedding performance
	Klubicka, Filip; Mahalunkar, Abhijit; Maldonado, Alfredo; Kelleher, John D.
	In: Conference papers (2019)
	Abstract: Creating word embeddings that reflect semantic relationships encoded in lexical knowledge resources is an open challenge. One approach is to use a random walk over a knowledge graph to generate a pseudo-corpus and use this corpus to train embeddings. However, the effect of the shape of the knowledge graph on the generated pseudo-corpora, and on the resulting word embeddings, has not been studied. To explore this, we use English WordNet, constrained to the taxonomic (tree-like) portion of the graph, as a case study. We investigate the properties of the generated pseudo-corpora, and their impact on the resulting embeddings. We find that the distributions in the psuedo-corpora exhibit properties found in natural corpora, such as Zipf’s and Heaps’ law, and also ob- serve that the proportion of rare words in a pseudo-corpus affects the performance of its embeddings on word similarity.
	Keyword: Artificial Intelligence and Robotics; Computational Linguistics; corpus; evaluation; Numerical Analysis and Scientific Computing; random walk; representations; Software Engineering; taxonomy; word embeddings; word similarity; WordNet
	URL: https://arrow.tudublin.ie/scschcomcon/271 https://arrow.tudublin.ie/cgi/viewcontent.cgi?article=1283&context=scschcomcon
	BASE
	Hide details

6	Size Matters: The Impact of Training Size in Taxonomically-Enriched Word Embeddings
	Maldonado, Alfredo; Klubicka, Filip; Kelleher, John D.
	In: Articles (2019)
	BASE
	Show details

7	Training corpus hr500k 1.0
	Ljubešić, Nikola; Agić, Željko; Klubička, Filip. - : Jožef Stefan Institute, 2018
	BASE
	Show details

8	Quantitative Fine-Grained Human Evaluation of Machine Translation Systems: a Case Study on English to Croatian ...
	Klubička, Filip; Toral, Antonio; Sánchez-Cartagena, Víctor M.. - : arXiv, 2018
	BASE
	Show details

9	Is it worth it? Budget-related evaluation metrics for model selection ...
	Klubička, Filip; Salton, Giancarlo D.; Kelleher, John D.. - : arXiv, 2018
	BASE
	Show details

10	Quantitative Fine-grained Human Evaluation of Machine Translation Systems: a Case Study on English to Croatian
	Sanchez-Cartagena, Victor Manuel; Toral, Antonio; Klubicka, Filip
	In: Articles (2018)
	BASE
	Show details

11	Is it worth it? Budget-related evaluation metrics for model selection
	Klubicka, Filip; Salton, Giancarlo; Kelleher, John D.
	In: Conference papers (2018)
	BASE
	Show details

12	hr500k – A Reference Training Corpus of Croatian.
	Erjavec, Tomaž; Ljubešić, Nikola; Klubicka, Filip...
	In: Conference papers (2018)
	BASE
	Show details

13	Croatian Twitter training corpus ReLDI-NormTag-hr 1.1
	Ljubešić, Nikola; Farkaš, Daša; Klubička, Filip. - : Jožef Stefan Institute, 2017
	BASE
	Show details

14	Serbian Twitter training corpus ReLDI-NormTag-sr 1.0
	Ljubešić, Nikola; Farkaš, Daša; Klubička, Filip. - : Jožef Stefan Institute, 2017
	BASE
	Show details

15	Croatian Twitter training corpus ReLDI-NormTag-hr 1.0
	Ljubešić, Nikola; Farkaš, Daša; Klubička, Filip. - : Jožef Stefan Institute, 2017
	BASE
	Show details

16	Serbian Twitter training corpus ReLDI-NormTag-sr 1.1
	Ljubešić, Nikola; Farkaš, Daša; Klubička, Filip. - : Jožef Stefan Institute, 2017
	BASE
	Show details

17	Fine-grained human evaluation of neural versus phrase-based machine translation ...
	Klubička, Filip; Toral, Antonio; Sánchez-Cartagena, Víctor M.. - : arXiv, 2017
	BASE
	Show details

18	Fine-Grained Human Evaluation of Neural Versus Phrase-Based Machine Translation
	Klubička Filip; Toral Antonio; Sánchez-Cartagena Víctor M.
	In: Prague Bulletin of Mathematical Linguistics , Vol 108, Iss 1, Pp 121-132 (2017) (2017)
	BASE
	Show details

19	Serbian-English parallel corpus srenWaC 1.0
	Ljubešić, Nikola; Esplà-Gomis, Miquel; Ortiz Rojas, Sergio. - : Jožef Stefan Institute, 2016
	BASE
	Show details

20	Finnish-English parallel corpus fienWaC 1.0
	Ljubešić, Nikola; Esplà-Gomis, Miquel; Ortiz Rojas, Sergio. - : Jožef Stefan Institute, 2016
	BASE
	Show details

Page: 1 2

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern